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ABSTRACT 

We present the first direct calibration of strong-line metallicity diagnostics at significant cosmological 
distances using a sample at z ~ 0.8 drawn from the DEEP2 Galaxy Redshift Survey. Oxygen and neon 
abundances are derived from measurements of electron temperature and density. We directly compare 
various commonly used relations between gas-phase metallicity and strong line ratios of O, Ne, and H at 
z ~ 0.8 and 2 = 0. There is no evolution with redshift at high precision (AlogO/H = —0.01 ± 0.03, 
AlogNe/O = 0.01 ± 0.01). O, Ne, and H line ratios follow the same locus at 2 ~ 0.8 as at 2 = 0 with 
<0.02 dex evolution and low scatter (<0.04 dex). This suggests little or no evolution in physical conditions 
of H II regions at fixed oxygen abundance, in contrast to models which invoke more extreme properties 
at high redshifts. We speculate that offsets observed in the [N n]/Ha versus [O ill]/H/3 diagram at high 
redshift are therefore due to [N n] emission, likely as a result of relatively high N/O abundance. If this is 
indeed the case, then nitrogen-based metallicity diagnostics suffer from systematic errors at high redshift. 
Our findings indicate that locally calibrated abundance diagnostics based on a-capture elements can be 
reliably applied at 2 ~ 1 and possibly at much higher redshifts. This constitutes the first firm basis for the 
widespread use of empirical calibrations in high redshift metallicity studies. 

Subject headings: galaxies: evolution — galaxies: ISM 


1. INTRODUCTION 

Gas-phase oxygen abundance (hereafter “metallicity”) is a 
valuable yet elusive diagnostic of galaxy evolution. Metal¬ 
licity is driven by the production of heavy elements via star 


(e.g., Tinsley & Larson 

1978; Edmunds 19901 

Erb 

2008 

Finlator & Dave 

2008 

J eeples & Shankar 

2011 

). Precise 


pecially in combination with accumulated stellar mass and 
gas content, therefore provide information on the history of 
gas flows which regulate galaxy mass growth. This prospect 
has motivated vast efforts to characterize galaxy metallic¬ 


al. 

20 

L3 

al. 

20 

15 


ity as a function of stellar mass and cosmic time (Erb et al. 
2006] Maiolino et al.||2008| |Mannucci et aLl|2(JIL)| |Richard 
et al. I 20111 IBelii et ai. 1120131 IHenry et al.||2013| IZahid et 

| Cullen et alT||2014| |wi - 


'uyts et al. || 2 6 14 


and many others), and to interpret t 


s 


Banders et 

re resultant 


motivated accretion and feedback (e.g., 

Brooks et al. 

2007 

Finlator & Dave 2008; Dave et al. 2011 

Brook et al. 

2012 

Romeo Velona et al. 2013] Obreja et ai. 

2014). Observa- 


that galaxies with lowe r stellar masses have l ower metal 


lici ty on average (e.g., Tremonti et al. 2004 Lequeux et 


al. 1979). This is commonly attributed to metal-enriched 


outflows which are more efficient at removing gas from low- 
mass systems. The mass-metallicity relation evolves to ward 


lower metallicity at high er redshifts (e.g., Maiolino et al. 


2008 Zahid et al. 20131, roughly commensurate wit h the 


increased gas fractions measured for modest samples (Tac 


coni et al. 2013). 
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Although progress towards quantifying the chemical evolu¬ 
tion of galaxies over cosmic time is seemingly impressive, 
essentially all of the data-driven results described above 
rely on indirect estimates of metallicity and should be re¬ 
garded with due skepticism. True measurements of nebu¬ 
lar ionic abundances require observations of either recom¬ 
bination lines, or collisionally excited emission fines com- 


bined with knowlec 

ge of the tern] 

rerature and density (e.g., 

Aller & Liller|1959 

Tsamis et al. 

2003 Stasinskaj 2004 

), al- 

though we caution 

that these methods are still sensitive to 


assum ptions s uch as the electron energy distribution (e.g., 


Nicholls et al. 2012). This presents an observational chal¬ 
lenge in that temperature-sensitive and recombination lines 
of heavy elements are fainter than the Balmer recombina¬ 
tion lines of hydrogen by factors of >100 and >1000, respec¬ 
tively. This has motivated the development of more practi¬ 
cal - although more uncertain - methods based on the flux 
ratios of Balmer lines and comparably bright collisionally 
excited met al lines. These “str ong line” methods, firs t in- 

, ex- 


Jensen et al. ( |1976 ) and Pagel et al. (1979), 


troduced by 

ploit the fact that various flux ratios are strongly correlated 
with direct measurements of metallicity. Relations between 
strong line ratios and metallicity can be calibrated either 
empirically from direct measurements of bright nearby ob- 
.jects, or through detailed photoionization modeling (e.g., 
Kewley & Dopita 2002 Dopita et al. 2013). Various such 
calibrations are now in widespread use and are virtually the 
only method used to estimate galaxy metallicities at high 
redshift. 

Strong line methods can in principle be used to infer metal¬ 
licities with an accuracy of <r(logO/H) ~ 0.1-0.2 dex (al¬ 
though uncertainty va ries depending on the line ratio and 


i oniza tion regime; e.g., |Pet tini fe Pagel|2004| 
20081. This uncertainty floor is limited by ii 


Maiolino et al.] 

by intrinsic scatter 
in metallicity at fixed strong line ratios, which arises from 
variations in physical properties such as the ionization pa- 
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rameter. In practice, however, there is no clear consensus 
on the absolute metallicity scale: different published cali - 


brations differ by up to 0.7 dex (Kewley & Ellison 


2008)! 


Nonetheless these discrepancies have not curtailed efforts to 
quantify relative metallicity evolution of galaxies up to red- 
shifts z > 3 using consistent sets of calibrations. Another 
concern is whether strong line methods can be reliably ap¬ 
plied at high redshifts at all, given that existing calibra¬ 
tions are based entirely on local galaxies and H II regions. 
If the physical state of H II regions evolves systematically 
with redshift, this will induce artificial evolutionary trends 
in metalli cities inferred from locally-calibrated diagnostics 
(e.g., Kewley et al.||2013 ). 


Large spectroscopic surveys have now conclusively shown 
that emission line ratios of high redshift galaxies (z > 1) are 
significantly offset from the locus formed by local galaxies. 
This is at l east true for the “BPT di agram” ( named for the 
authors Baldwin, Phillips, & Terlevich 1981) of [O iii]/H/3 
versus [N Ii| /Ha tor w hich the typical offset at z ~ 2.3 is 
0.2-0.4 dex ( Shapley et al.|2015 Steidel et al.|2014 ). Other 
diagrams involving oniy oxygen, hydrogen, and suifur lines 
do not show significant offsets, however. Since [N ii]/Ha 
and 03N2 = ([O iii]/H/J)/([N n]/Ha) are among the most 
commonly us ed strong-line metallicity indicators at high 
redshift (e. g.,"' 
etalp015l ), t 


Erb et al. 2006| 
leir intrinsic evo 


Steidel et al. 2014 Sanders 
ution necessitates a quanti- 


tative revision of many previous results. Several possib le 
causes for this evolution have been proposed (e.g., Kewley 


et al. 


et al. 


2013) and there is some evidence that elevated^elec¬ 


tron densit y may be at least pa rtially responsible (Hainline 


2009 Shirazi et al. 20141. Increased stellar effective 


temperatur es can atso explain the evolution (as considered 
in detail by Steidel et al.120141, although this should result 


in similar offs ets in |S II [/Ha versu s [O Ill]/H/3, which are 


not observed (Shapley et al. 2015). In some cases the off¬ 
sets are explaine d by a com bination of star formation and 
AGN excitation (Wright et al.||2010| |Newman et al. ||2014|) . 
However, integral field spectroscopic observations of lensed 
galaxies show that these offsets are present even for indi¬ 
vidual giant H II regions (|Jones et al.||2010| |2013|) . At the 
very least it is clear that metallicities interred from these 
diagnostics are not consistent with local sam ples nor even 
internally among high redshift samples (e.g., 

20141. 


Steidel et al. 


Our study is motivated primarily by measurements of 
strong evolution in the BPT line ratio diagram and the 
associated unknown astrophysics. Ultimately we wish to 
understand the physical origin of evolution in nebular emis¬ 
sion spectra, and to determine which (if any) strong line 
metallicity diagnostics can be calibrated for reliable use 
at high redshifts. These goals require accurate measure¬ 
ments of the physical properties of high redshift galaxies, 
particularly metallicity. In this paper we analyze a sample 
of galaxies at z ~ 0.8 for which we obtain sensitive mea¬ 
surements of electron density and temperature from the 
nebular emission spectra. This provides direct metallicities 
which we compare with local galaxies analyzed in exactly 
the same manner. Our analysis relies principally on the 
temperature-sensitive [O iii]A 4363/[0 ill]AA4959,5007 flux 
ratio. Measurements of the weak [O ill] A4363 line have pre¬ 
viously been repo rted for modest samples of intermediate- 


redshift 

galaxies ( 

Hoyos et al. 

2005^ 

Kakazu et al. 

2007 

Ly et al. 

2014a|b| 

Amorin et" al. 

2014) 

as well as a handiu 

at z > 1 

(Yuan & Kewley] 

200 

Christensen et al. 

2012 


Brammer et al.|2012bl |Stark et al.||2013| |James et al.||2014 

Maseda et al.||2014| ), but no effort has yet been attemptec 

to calibrate strong line metallicity diagnostics beyond the 
local universe. This paper represents a significant first step 
and we hope that our work can soon be extended with well- 
defined samples at higher redshifts. 


This paper is structured as follows. Section [2] presents our 
sample selection and a brief discussion of sample bias mit¬ 
igation. Measurements of physical properties (nebular red¬ 
dening, electron temperature, electron density, and metal¬ 
licity) are described in Section [3] Strong line metallicity 
calibrations are calculated in Section [4] Section [5] quantita¬ 
tively compares the (lack of) redshift evolution in relations 
between observed emission line ratios and derived physi¬ 
cal properties; this section constitutes the primary results 
of our work. We discuss some interesting implications and 
briefly summarize our findings in Sections [6] and [ 7 ] respec¬ 
tively. Throughout the paper we make use of the following 
notation conventions: unless stated otherwise, [O ii], [O ill], 
and [Ne in] refer to [O n]AA3727,3729, [O hi] A5007, and 
[Ne m]A3869 respectively. Expressions such as [O m]/[0 n] 
refer to the emi ssion line flux ratios. All magnitudes are in 
the AB system (Oke 19741. 


2. SAMPLE SELECTION AND ANALYSIS 


We have carefully selected a sample of distant gal axies 


from the DEEP2 Galaxy Redshift Survey (DEEP2; Davis 


et al. 2003 Newman et al. 2013) which are suitable tor 


temperature-based metallicffy analysis. DEEP2 obtained 
redshifts for >50,000 galaxies at up to z < 1.4 using the 
DEIMOS spectrograph on the Keck II telescope. The spec¬ 
tra typically span a wavelength range of ~6500-9100 A 
with resolution R ~ 5000. All spectra are corrected for 
telluric absorption and system throughput using methods 
developed by the DEEP2 team (Cooper et al. 20121, and 
corrected for M ilk y Way extinc tion using the dust map of 


Schlegel, Finkbeiner, & Davis (1998). We examined all 
galaxy spectra with secure"redshifts (DEEP2 quality code 
Q = 3 or 4) such that the wavelength coverage spans at least 


3626-4960 A in the rest frame, thereby including strong 
emission lines of [O ii] and [O ill]. For initial selection, we 
fit single Gaussian profiles to [O III] A A4959,5007 and require 
a combined signal-to-noise >80 (or >20 in [O III] A4959 for 
cases where the 5007 line is not covered). This yields 254 
spectra with redshifts 0.72 < £ < 0.87. After removing du¬ 
plicate observations, spectra where a crucial emission line 
(either Hy or [O ill]A4363) fell in the DEIMOS chip gap, 
and spectra strongly affected by telluric absorption residu¬ 
als, this sample is reduced to 196 galaxies. We identify 10 
of these as containing an active galactic nucleus (AGN) on 
the basis of broad emission lines. 


For each spectrum we measure the flux and uncertainty 
of various emission lines using Gaussian profile fits. The 
lines of interest for this paper are [O ill]AA4959,5007, 
[O iii]A4363, [Ne iii]A3869, [O n]AA3727,3729, H/3, Hy, and 
HA All lines are fit with a single component except for the 
[O Ii] doublet, which we fit with a double Gaussian function 
using the total flux and doublet ratio as free parameters. In 
all cases we fix the redshift and line width to the values de¬ 
rived at high signal-to-noise from [O iii]AA4959,5007. This 
maximizes the precision of flux measurements for fainter 
lines. In the ~30% of cases where [O ill] A5007 is not cov¬ 
ered, we adopt a flux ratio [O hi] A5007/[0 iii]A4959 = 3. 
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The hydrogen Balmer lines are affected by underlying stel¬ 
lar absorption, such that emission fluxes measured from a 
single-component fit will underestimate the true value. In 
general the stellar continuum is not strong enough to ro¬ 
bustly measure Balmer absorption profiles for individual 
galaxies. We therefore constructed a composite from the 
median of all spectra, normalized to the continuum flux 


at rest frame 4150-4300 A. We fit the composite spectrum 
with a linear combination of simp le stellar populati on tem¬ 
plates from the library of Gonzalez Delgado et ah dl999l), 
usin g the penalized pixel fitting method of Cappellari 


Emsellem (2004). The resulting average stellar spectrum is 
scaled to the continuum level of each galaxy in our sample 
and subtracted, and we re-fit the Balmer lines to deter¬ 
mine the level of stellar absorption. The median correction 


for stellar absorption is 1 . 1 - 1.2 A in rest frame equivalent 
width for H/3, Hq, and Hd. We correct the flux of each 
object for this absorption and include a conservative uncer¬ 


tainty of ±0.6 A (rest frame) to account for sample vari¬ 
ance. This additional uncertainty is propagated throughout 
the analysis. The stellar absorption correction is typically 
small (median ^6% in Hd and ~2% in H/?) due to the large 
Balmer emission equivalent widths. 


2.1. The T e sample 


The goals of this work require a uniformly selected sam¬ 
ple with accurate measurements of electron temperature 
and density from nebular emission line ratios. The par¬ 
ent DEEP2 sample of 186 star forming galaxies spans a 
range of emission line strengths, with more than an or¬ 
der of magnitude variation in the detection significance of 
[O III]AA4959,5007. Consequently there is a wide range 
in measurement precision for the temperature-sensitive 
ratio Rroi„] = ([O iii]A4363)/([0 iii]AA4959,5007), il¬ 
lustrated in Figure jTj We find a median flux ratio 
[O iii]A5007/[0 iii]A4363 ~ 100 (Figure 0. In the remain¬ 
der of this paper we therefore predominantly analyze the 
sub-sample for which [O iii]A5007 flux is >300 times larger 
than the uncertainty in [O III]A4363 flux (i.e., the ratio 
R[o hi] is measured with la precision <0.0025), which we 
refer to as the “T e sample.” This ensures a >3er detection of 
[O ill] A4363 in most cases, although the actual significance 
is subject to both noise fluctuations and intrinsic physical 
variation. Ultimately this selection yields a sample of 32 
galaxies with a median [O ill]A4363 significance of 5.3er. 
Example spectra spanning the range of [O ill] A4363 fluxes 
are shown in Figure [2] The following analysis is based on 
this sample of 32 galaxies unless stated otherwise. 


To place our sample in the context of the general galaxy 
population, we show a color-magnitude diagram in Figure[3] 
The T e sample lies in the star-forming “blue cloud” with 
representative luminosities and relatively blue colors com¬ 
pared to the population of sta r-formin g g alaxies at z ~ 0.8. 
We refer interested readers to Ly et al. (2014b) for a more 
detailed discussion of stellar population properties and their 
relation to metallicity, which is beyond the scope of this pa¬ 
per. The T r sample is simi lar to the 28 DEEP2 galaxies 
analyzed by Ly et al. (2014b), and indeed our samp l es over¬ 


lap by ^50%! The differences are that Ly et al. (2014b I 


require a >3er detection of [O lll]A4363 and coverage of 
[O ill] A5007, and do not impose a restriction on Rj 0 m ] 
precision. In terms of the Ly et al. (2014b) sample, those 
galaxies which also appear in the T e sample have represen¬ 



Fig. 1.— Sample selection and bias mitigation. Small black points 
show the parent sample of 186 star forming galaxies drawn from the 
DEEP2 survey. The data span a wide range of temperature-sensitive 
[O ill] flux ratios. The sample median and 2cr statistical deviation are 
shown by dashed and dotted lines, respectively, demonstrating that 
scatter in [O ill] ratios is largely (though not entirely) due to noise. 
To reduce the effects of random noise, we select a sub-sample above 
a limiting sensitivity in the [O ill] line ratio, shown by the larger 
blue points. We also show the sub-sample for which [O ill] A4363 
is detected at > 3cr significance (open squares). Selecting based on 
[O III] A4363 significance would clearly induce a strong bias toward 
higher R[q ml (and therefore higher temperature and lower metallic¬ 
ity). Our sensitivity-based selection eliminates this bias and ensures 
a representative distribution of derived metallicities. 

tative stellar masses (~10 8 — 10 9 M 0 ) and relatively high 
metallicities, reflecting the different selection criteria. 


2.2. Local comparison sample 

We require an appropriate z = 0 reference sample to ana¬ 
lyze the degree of evolution in physica l properties. For th is 
purpose we use the measurements of Izotov et al. (2006), 
whose galaxy sample is selected from Data Relea se 3 of the 
Sloan Digital Sky Survey (SDSS; |Abazajian et al. 20051 
based on H [3 flux (> 10 14 ergs s _1 cm -2 ) and lack of nu¬ 
clear activity. While the selection is not identical to the 
DEEP2 sample, it is similarly based on strong emission line 
fluxes and, crucially, does not explicitly select for detection 
of [O ill]A4363. We analyze only the 113 spectra (at me¬ 
dian z = 0.05) with coverage of [O ii]AA 3727,3729 and with 
[O ill] A5007 fluxes >300 times larger than the uncertainty 
in [O ill] A4363 flux. This ensures good precision a nd con¬ 
sisten cy with the z — 0.8 sample, although the full j lzotov 
et al. (2006) sample gives consistent results. 


For purposes of comparing strong emission line ratios, we 
also utilize the MPA-JHU catalog 8 of s pectra l measure¬ 
ment s from Data Release 7 of the SDSS (Abazajian et al. 


2009). We identify star forming galaxies according to their 


[IN il]/H a and [O llll/H/3 flux ratio s using the classification 
scheme of Kauffmann et al. (2003) and select a sub-sample 
for which the following relevant lines are all detected at 
>10cr significance: [O ii]AA3727,3729, [Ne iii]A3869, HJ, 


3 http://www.mpa-garching.mpg.de/SDSS/DR7/ 
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[Oil] [NeIII] H5 Hy [OIII] Hp [OIII] 




3800 4000 4200 4400 4600 4800 5000 
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Fig. 2.— Spectra from the T e sample with the highest (top), me¬ 
dian (middle), and lowest (bottom) formal signal-to-noise ratio in 
[O III] A4363. These examples are chosen to show the range of data 
quality. Various salient characteristics are apparent such as the large 
Balmer emission equivalent widths, high [O ill]/H/5 flux ratios, and 
well-resolved [O n] AA3727,3729 doublet ratios indicative of low elec¬ 
tron densities (< 200 cm -3 ). 


H 7 , H/3, Ha, [N n]A6584, and [O iii]A 5007. This yields 
~20,000 spectra. Low-level AGN activity is increasingly 
difficult to diagnose for flux ratios below [O m]/H /3 < 1, 
however the spectra analyzed in this work have [O ill]/H^ > 
2 in all cases. The SDSS sample shown e.g. in Figure [4| is 
therefore representative of star forming galaxies in the local 
volume for the relevant range of emission line ratios. 


5 


4 


3 


1 


0 


-1 


Fig. 3. — Color-magnitude diagram of galaxies with secure 0.7 < 
z < 0.9 from t he DEEP2 surve y, using the photometric measurements 
presented in|Coil et al.|(|2004|. B, R, and I magnitudes refer to the 
observed pass bands (and. apparent color) rather than the rest frame. 
Grey shading represents the square root of the number of objects 
in each bin. A clear bimodality is apparent, with the passive “red 
sequence” at B-R > 2 and the star-forming “blue cloud” at B-R < 2. 
Galaxies studied in this paper populate the blue cloud, with blue 
colors and approximately representative luminosities relative to the 
overall star forming population. 
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the larger SDSS sample. It is also apparent in Figure [4] 
that star forming galaxies at z ~ 0.8 lie along the same 
locus as local galaxies in terms of these line ratios, with no 
significant offset and comparable scatter. 

3. PHYSICAL PROPERTIES 
3.1. Nebular extinction and reddening 


2.3. Sample bias 

We have been careful to construct a sample without im¬ 
posing any explicit restriction on [O ill] A4363 flux. Fig¬ 
ure [I illustrates that requiring a detection of [O III] A4363 
would induce a bias toward artificially high R[ 0 m] and 
hence higher derived temperatures and lower metallicities. 
Selecting on the basis of strong-line ([O m]AA4959,5007) 
flux and spectral sensitivity simultaneously mitigates this 
bias and ensures good constraints on T e and metallicity, 
even for cases with very weak [O ill]A4363 emission. To 
check whether the adopted sample selection may introduce 
other biases, we plot strong line ratios of [O iii]/H/ 3 versus 
[Ne ill] / [O II] in Figure |4j The T e subsample has higher 
average [O iii]/H/ 3 compared to the parent sample, as one 
might expect given the selection based on [O ill] flux. How¬ 
ever there appears to be no systematic bias in the sense that 
the T e subsample has [O III] /H/3 ratios which are normal 
considering their other properties (e.g., [Ne ra]/[0 Ii] and 
other emission line ratios). Likewise the local comparison 
sample is selected on the b asis of str ong emissi on lines with¬ 
out regard to [O m]A4363 ( |Izotov et a.1. [2006 ), and the line 
ratios indicate no significant systematic offset compared to 


Accurate correction for reddening is essential for the pur- 
poses of this s tudy. We parameterize the reddening as a 


Cardelli et al. (1989) extinction curve with Ry = 4.05, and 
determine the extinction and its uncertainty from a simul¬ 
taneous fit to the Balmer lines H/3, Hy, and H(5. Different 
Balmer line ratios generally give consistent results, agree¬ 
ing within the formal ltr uncertainty in 62% of the sample. 
We do not use lines of higher order than Hd because they 
suffer from non-negligible contamination (by [Ne ill] A3968 
and Ca H in the case of He, and He I A3889 in the case 
of HO and increased sensitivity to stellar Balmer absorp¬ 
tion. Balmer decrements and best-fit extinction A(V) for 
the z ~ 0.8 sample are shown in Figure [5j All emission 
lines are corrected for the best fit extinction, and its un¬ 
certainty is propagated throughout the following analysis. 
Reddening-correcting emission line fluxes are given in Ta¬ 
ble [l] along with best fit extinctions for each galaxy in the 
T e sample. 


Balmer line fluxes provide tight constraints on the color ex¬ 
cess E(B—V) = A(V)/R v for all galaxies in the T e sample. 
However, we cannot meaningfully constrain Ry due to the 
limited rest-frame wavelength range of the DEEP2 spectra. 
Consequently the total extinction A(V) is uncertain, but 














































































Direct Metallicity Measurements at z = 0.8 


5 


TABLE 1 

De-reddened emission line fluxes 


ID 

z 

[O hi] A4959 

H/3 

[O hi] A4363 

[Ne hi] A3869 [ 

O n]AA3727,372£ 

' KPi snr(h py 

A(V) 

1 

0.771 

2.11±0.05 

1.00 

0.092±0.015 

0.528T0.040 

1.48T0.11 

1.30±0.06 

44 

3.01T0.28 

2 

0.819 

1.42±0.04 

1.00 

0.081T0.014 

0.340T0.027 

1.82±0.13 

1.31T0.04 

44 

1.66T0.26 

3 

0.746 

2.07T0.02 

1.00 

0.115F0.006 

0.540T0.012 

0.96T0.02 

1.25T0.03 

114 

1.65T0.08 

4 

0.747 

1.05T0.02 

1.00 

0.018F0.009 

0.229T0.010 

2.65±0.09 

1.10T0.02 

104 

2.58T0.12 

5 

0.762 

1.56±0.03 

1.00 

0.043T0.010 

0.358T0.019 

2.70T0.13 

1.34±0.02 

73 

1.46T0.18 

6 

0.780 

0.93F0.02 

1.00 

0.025F0.005 

0.219F0.010 

2.37T0.08 

1.44±0.03 

83 

2.72±0.12 

7 

0.771 

2.08T0.03 

1.00 

0.067T0.008 

0.483T0.017 

1.66±0.06 

1.25T0.02 

93 

1.60T0.12 

8 

0.766 

1.50T0.05 

1.00 

0.028T0.013 

0.320T0.032 

2.54±0.20 

1.54±0.04 

47 

1.44±0.29 

9 

0.762 

1.28T0.02 

1.00 

0.032±0.011 

0.335F0.021 

2.67T0.15 

1.37F0.03 

78 

2.37F0.22 

10 

0.841 

1.65T0.04 

1.00 

0.063T0.014 

0.415T0.027 

1.90T0.12 

1.48T0.04 

56 

1.57T0.23 

11 

0.767 

2.01T0.04 

1.00 

0.026F0.019 

0.469F0.025 

1.85±0.09 

1.33T0.04 

71 

1.99F0.18 

12 

0.794 

1.74±0.03 

1.00 

0.051F0.009 

0.379T0.018 

1.79T0.08 

1.34±0.03 

78 

1.52±0.16 

13 

0.798 

1.91T0.05 

1.00 

0.095T0.014 

0.434±0.031 

1.66T0.11 

1.40T0.05 

44 

1.82±0.25 

14 

0.842 

1.75T0.04 

1.00 

0.090T0.011 

0.421±0.026 

1.38±0.08 

1.45T0.06 

53 

0.85F0.21 

15 

0.796 

0.84±0.01 

1.00 

0.007F0.005 

0.160T0.009 

2.15±0.07 

1.03F0.01 

139 

2.24±0.12 

16 

0.748 

1.80T0.04 

1.00 

0.151T0.019 

0.455T0.025 

1.53±0.08 

1.38T0.05 

62 

1.61T0.18 

17 

0.738 

1.53T0.04 

1.00 

0.092±0.012 

0.520T0.027 

1.66±0.08 

1.39T0.04 

54 

2.55T0.17 

18 

0.793 

1.65F0.04 

1.00 

0.064±0.016 

0.365T0.032 

2.06±0.15 

1.35T0.03 

52 

1.16T0.26 

19 

0.793 

1.85T0.01 

1.00 

0.073T0.005 

0.435T0.010 

1.93±0.04 

1.42±0.02 

169 

2.18T0.08 

20 

0.755 

1.82±0.03 

1.00 

0.119F0.010 

0.434±0.019 

0.93±0.04 

1.24±0.05 

87 

1.94±0.15 

21 

0.788 

1.63T0.04 

1.00 

0.052±0.015 

0.426T0.034 

1.74±0.12 

1.23T0.05 

54 

2.11±0.25 

22 

0.856 

2.17T0.05 

1.00 

0.064±0.012 

0.330T0.020 

1.20±0.07 

1.25T0.04 

58 

0.40F0.21 

23 

0.732 

1.44±0.02 

1.00 

0.044±0.010 

0.407F0.024 

2.56±0.13 

1.22±0.03 

85 

3.88F0.19 

24 

0.802 

1.44±0.05 

1.00 

0.011T0.009 

0.338T0.028 

2.08±0.17 

1.35T0.03 

33 

1.39T0.29 

25 

0.766 

2.01±0.06 

1.00 

0.058T0.013 

0.408T0.034 

1.69T0.13 

1.35T0.04 

51 

0.94±0.28 

26 

0.721 

1.78T0.03 

1.00 

0.075F0.010 

0.407F0.019 

1.28±0.06 

1.19T0.03 

78 

0.96F0.17 

27 

0.774 

1.64±0.04 

1.00 

0.070T0.011 

0.353T0.026 

2.23±0.14 

1.30T0.03 

42 

2.27T0.21 

28 

0.731 

1.99T0.02 

1.00 

0.063T0.004 

0.492±0.012 

1.41±0.03 

1.25F0.02 

133 

1.34±0.08 

29 

0.751 

1.70T0.03 

1.00 

0.060T0.013 

0.363T0.025 

1.93±0.12 

1.38T0.04 

78 

2.08T0.24 

30 

0.833 

1.39T0.02 

1.00 

0.023T0.009 

0.361T0.018 

2.39T0.11 

1.26T0.02 

84 

1.65T0.17 

31 

0.749 

1.98T0.04 

1.00 

0.056F0.017 

0.442±0.025 

1.95±0.10 

1.44±0.04 

63 

2.74±0.19 

32 

0.751 

1.85T0.04 

1.00 

0.063F0.013 

0.418T0.029 

1.73T0.12 

1.32±0.05 

69 

2.73T0.25 


1 Signal to noise ratio of H/J emission. 


this does not affect our results since we are concerned only 
with line ratios. Fortunately all de-reddened emission line 
ratios used in this work are insensitive to Ry (Figure [6]) and 
are therefore robust. Future observations of Ha or Paschen 
series lines could be used to measure Ry and provide accu¬ 
rate de-reddened emission line luminosities. 

Since many of the line ratios of interest are sensitive to red¬ 
dening corrections (such as [O m]/[0 n]), we have carefully 
considered possible associated systematic errors. One po¬ 
tential source of error is the correction for stellar Balmer 
absorption, which corresponds to an average increase of 1% 
in Hy/H/? and 3% in H5/H/3 for our sample. This is small 
compared to the total uncertainty in the line ratios (Fig¬ 
ure pi) and does not introduce significant systematic errors. 
Even for an extreme case where the true stellar absorp¬ 
tion has been underestimated by a factor of 2, the ratio of 
[O hi]A5007/[0 ii]AA 3727,3729 would be within 0.02 dex 
of the true value (with smaller corrections for other line 
ratios). We also examined the effect of differential atmo¬ 
spheric refraction which may cause increasing slit lo sses a t 
bl uer wavelengths. We use the prescriptions of Filip penko 
(1982) to calculate slit loss as a function of wavelength for 
the median seeing, airmass, and slit position angle of the 
DEIMOS observations. The expected effect is that the total 
extinction is overestimated by ~0.01 dex, with [O 11 ] and 
[Ne ill] emission line fluxes underestimated by only < 0.01 
dex compared to the other emission lines used in this work. 
In summary, systematic errors in the de-reddened line ra¬ 
tios are expected to be <0.02 dex. Despite this small un¬ 
certainty we strive to use reddening-independent line ratios 


(i.e., with close wavelength spacing such as in Figure [4| 
wherever possible. 

3.2. Electron temperature and density 

Nebular electron temperatures and densities are derived us¬ 
ing the IRAF nebular.temdens package. We first esti¬ 
mate temperatures T e from the R[o m i ratio assuming an 
electron density n e = 100 cm -3 , and then calculate n e from 
the [O n]AA3727,3729 doublet ratio using the derived T e . 
This process is iterated and results in no significant change 
compared to uncertainty in the flux ratios. Importantly, 
we find values of n e < 400 cm -3 and T e < 2 x 10 4 K in all 
cases, as shown in Figure [ 7 ] In this regime T e is insensitive 
to n e (<50 K variation). Density is weakly sensitive to T e 
in this regime and we account for this in the uncertainty of 
n e , although the effect on derived abundances is minimal 
(<0.01 dex). The T e values we derive differ by <3 00 K from 
the analytic method used by Izotov et al. (2006), resulting 
in <0.01 dex difference in the derived oxygen abundance. 
Figure [ 7 ] shows n 0 and T e of our sample and the local com¬ 
parison sample. Notably, we find no evidence of evolution 
in these physical properties for galaxies with similar ratios 
of strong emission lines. 

We are unable to measure the singly ionized oxygen temper¬ 
ature from DEEP2 spectra. Instead we estimate T e ([0 11 ]) 
from measu re ments of T e ([0 III]) using the same method as 
Izotov et al. (2006) in order to facilitate direct comparison 
of the results. We measure a scatter about the best-fit lin¬ 
ear T e ([0 n])-T e ([0 III]) relation of cr(T e ([0 11 ])) = 1030 K 
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Fig. 4.— Strong emission line ratios at 2 : ~ 0.8 compared to local 
galaxies. Grey shading shows the populati on of SPSS galaxie s which 
are classified as star forming according to |Kauffmann et al.| (|2003|). 
The increased scatter and general trend at [U in]/tip ^ 1 may be due 
in part to a mix of star formation and AGN excitation in these galax¬ 
ies; the ionizing source is increasingly ambiguous at lower [O ill]/H/5. 
Our 2 : ~ 0.8 sample is consistent with the local star-forming locus, ly¬ 
ing exclusively at relatively high ratios of [O iii]/H/ 3 and [Ne ill] / [O 11 ]. 
The T e subsample has even higher line ratios on average but likewise 
shows no offset from the locus of galaxies at 2 : ~ 0 , nor from the 
broader population at 2 : ~ 0.8. Broad-line AGN in the parent z ~ 0.8 
sample lie near the locus of star formation with a considerably larger 
scatter. 


from the data presented by Pilyugin et al. (2010). While 
this overestimates the true variation in T e ([0 Ilj) at fixed 
T e ([0 ill]), we include a conservative uncertainty of 10 3 K 
in our estimates of T e ([0 Ii]). This is added in quadrature 
with uncertainty arising from measurement error. We cau¬ 
tion that various prescriptions for the T e ([0 n])-T e ([0 ill]) 
relation exist in the literature, and derived metallicities are 
moderately sensitive to the adopted relati on. For example, 
the measurements of Pilyugin et al. (2010) suggest values of 
T e ([0 n]) which are ~10 3 K lower on average than those we 
adopt; this would result in a ~0.02 dex increase in oxygen 
abundances for our sample. 


3.3. Metallicity 


Fig. 5.— Balmer emission line ratios and best-fit extinction A(V). 
Multiple Balmer lines generally give consistent results. The solid col¬ 
ored line shows expected Balmer decrements as a function of A(V) 
for our adopted extinction curve with Ry = 4.05. Different values 
of Ry result in an indistinguishable curve (given the measurement 
uncertainties), with A(V) varying in proportion to Ry in such a way 
that the color excess E(B— V) = A(V)/Ry is robust. 


[Ne III] HS Hy Hp [O III] Ha 



Ionic abundances are derived from the density, tempera¬ 
ture, and de-reddened line ratios of each galaxy in our sam¬ 
ple. Fbr_consistency_we adopt the analytical formulae used 
by Izotov et al. (2006) to derive abundances of 0 + /H + , 
0 2+ /H + , and Ne^ + /H + . We calculate the total oxygen 
abundance as 

O _ 0 + 0 2+ 

” H+ 


H 


H+ 


No correction is made for higher ions, which are expected 
to account for <1% in all but three galaxies i n our sample 
based on their 0 + /0 2+ ratios ( Izotov et al.|2006 ). We have 
also calculated abundances using the IRAF NEBULAR.IONIC 
package, and we find that IRAF produces lower metallici¬ 
ties by AlogO/H ~ 0.03 dex. For neon, we convert from 


Fig . 6. — Emission line flux ratios compared to various |Cardelli| 
|et al. |1989]) extinction curves, demonstrating a degeneracy in Ry. 
All values are relative to a fiducial curve obtained by fitting the sam¬ 
ple mean Balmer emission line ratios with Ry = 4.05. Data points 
show sample mean Balmer fluxes relative to H/3 (assuming intrin¬ 
sic Balmer decrements from case B recombination: H 7 /H f3 = 0.47, 
H<5/H/3 = 0.26) and mean [Ne ill] relative to [O ill] (assuming intrin¬ 
sic [Ne hi]/[O in] = 0.077, the mean de-reddened value from the local 
comparison sample). We additionally fit the Balmer line ratios us¬ 
ing extinction curves with various Ry (dashed lines). We note that 
Rv = 3.1 a nd 4.05 are typically assumed for the Milky Way ( Car dellil 
|et al. 1 1989|) and for starburst galaxies (e.g., JCalzetti et al.|2O0"o|, re¬ 
spectively. The data are in excellent agreement and cannot constrain 
Ry due to the short wavelength range probed, although different Ry 
curves diverge rapidly at longer and shorter wavelengths. Future mea¬ 
surements of Ha or other features beyond the present wavelength 
range could therefore be used to determine Ry. 
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Fig. 7.— Nebular electron density n e and temperature T e as a func¬ 
tion of [O iii]/H/3. There is no evolution in these properties: galaxies 
at 2 : ~ 0.8 have densities and temperatures which are typical of local 
galaxies with similar strong line ratios. Densities are derived from the 
[S n]AA6717,6731 doublet for the z = 0 comparison sample and from 
the [O n] AA3727,3729 doublet for the 2 : ~ 0.8 sample. Temperatures 
are derived from the R[q hi] ratio for both cases. 


Ne 2+ /H + to total Ne/H using the same method as 


Izotov et 


al. (2006). O/H and Ne/O abundances for our sample are 


shown in Figure pi Relative abundances of neon and oxygen 
are measured with higher precision than O/H and Ne/H, 
since Ne/O is less sensitive to the electron temperature. 


3.4. Uncertainty 

We have been careful to assess and mitigate potential 
sources of systematic uncertainty in deriving physical quan¬ 
tities. Ne/O abundance ratio measurements (Figure[8]) pro¬ 
vide an empirical estimate of the degree of both random 
and systematic uncertainty. Neon and oxygen are predom¬ 
inantly generated by the same nucleosynthetic processes, 



12+log(0/H) 


Fig. 8 .— Neon and oxygen abundances. The Ne/O abundance 
ratio at 2 : ~ 0.8 is consistent with local galaxies; the median value is 
log Ne/O = —0.72 zb 0.01 for both samples. For reference, estimates 


of the solar value are typically logNe/O^ = —0.8 to —0.65 (Asplund 
|et al.|2009| |. 


and consequently Ne/O abundance shows little variation 
in H II regions and planetary nebulae spanning a wide 
range of metallicity and other physical pr operties (e.g., Izo-| 


tov et ah] 2011 Leisy & Dennefeld 2006). The modest in¬ 


crease in iSle/O at higher O/H, amounting to a variation of 
A log Ne/O ~ 0.1 dex across the s amples considered here, 
is discussed further in Section [5721 


Both the DEEP2 and local comparison samples have a me¬ 
dian abundance ratio of log Ne/O = —0.72 ± 0.01. Such 
excellent agreement indicates that any systematic errors af¬ 
fecting the comparison between these two samples are lim¬ 
ited to < 0.01 dex. In particular, Ne/O is highly sensitive 
to estimates of reddening and differential refraction due to 
the relatively large wavelength separation of [Ne III] A3869 
and [O ill] AA4959,5007. We infer that there is minimal un¬ 
certainty arising from these and other effects, or else that 
both samples are affected by similar amounts. Most impor¬ 
tantly this confirms that direct comparison of the two sam¬ 
ples yields reliable results. We caution that studies employ¬ 
ing different methods are not necessarily comparable (e.g., 
abundances derived from IRAF’s NEBULAR. IONIC task dif¬ 
fer systematically by ~ 0.03 dex as discussed above). 
Additionally, 19 of the 32 galaxies in the DEEP2 sample are 
compatible with log Ne/O = —0.72 at the la level, consis¬ 
tent with expectations for an approximately constant Ne/O 
abundance ratio. Allowing for a trend of increasing Ne/O 
with O/H, we find that 24/32 galaxies are consistent (ler) 
with the best-fit linear relation, also in agreement with sta¬ 
tistical expectations. This sanity check confirms that un¬ 
certainties adopted for our sample are similar to the true 
measurement error. 


4. STRONG-LINE ABUNDANCE DIAGNOSTICS 
































































































We construct a set of diagnostic relations between oxy¬ 
gen abundance derived in Section [3j expressed as 12 + 
logO/H, and the following strong emission line ratios: 
[Ne m]/[0 n], [O m]/[0 n], [O ra]/H/3, [O ii]/H /3, and 
R 23 = 1° n ] + [° in]AA4959,j007 ^ gj nce wg fi nc [ nQ evo ] u tion in 

these relations (Section [5~3] ), we derive them from the local 
comparison sample. Independent calibrations of the 2 : ~ 0.8 
sample are fully consistent but with larger uncertainty. A 
calibration of the combined data sets also gives consistent 
results and does not substantially improve the precision. 


We anchor the metallicity calibrations to fits of 
[Ne m]/[0 11 ] and [O m]/[0 11 ] as a function of metallic¬ 
ity. These two relations are examined first because they 
are known to be monotonic, at least for local galaxies (e.g., 


Maiolino et al. 20081. We fit the data using a functional 
form 


logR = c 0 + cpr, ( 1 ) 

where R is the line ratio and x = 12 + log O/H. The result¬ 
ing fits have reduced xt values larger than one, which we 
attribute to intrinsic scatter in these relations. Assuming 
that the estimated uncertainties are accurate, we calculate 
the intrinsic scatter fr int by solving for the expectation that 


.2 = JL V AlogR = 1 
W n - 2 ■ -2 ’ 


°7nt + < 


( 2 ) 


where <T mea s is the measurement uncertainty and N is the 
number of degrees of freedom in the fit. 

The other line ratios exhibit significant higher-order trends 
and we therefore adopt a second-order polynomial relation 


log R = c 0 + ci a: ± c 2 a: 2 . (3) 

However, fitting logR to logO/H directly generally fails to 
capture the non-linear behavior, likely because of limited 
dynamic range and relatively large uncertainty in metal¬ 
licity. Including higher order polynomial terms only ex¬ 
acerbates this problem. Therefore, we obtain metallicity 
calibrations by fitting the line ratios to 

log R = a 0 +cii y + a 2 y 2 (4) 


TABLE 2 

Strong-line abundance diagnostic coefficients 



R 

CO 

Cl 

C2 

(Tint 1 (dex) 


Ne iii]/[0 11 ] 

16.8974 

-2.1588 


0.22 


O hi] / [O 11 ] 

17.9828 

-2.1552 


0.23 


Ne iii]/[0 hi] 

-1.0854 

-0.0036 


0.04 


O hi] /H/3 

-88.4378 

22.7529 

-1.4501 

0.10 


O ill/H/3 

-154.9571 

36.9128 

-2.1921 

0.15 

R 23 

-54.1003 

13.9083 

-0.8782 

0.06 

02Ne3 

-53.5153 

13.9083 

-0.8782 

0.08 


1 Intrinsic scatter in logR at fixed logO/H. 


represents the mean [O m]AA4959,5007/[Ne ill] ratio. In 
both of these cases we calculate the intrinsic scatter from 
Equation [2j While the values in Table [2] are derived from 
the local comparison sample, results from the z ~ 0.8 T e 
sample are consistent with the same coefficients and scatter. 
Higher order polynomial terms do not significantly improve 
the fit quality or derived scatter compared to the adopted 
results from Equations [I] and [3] We caution that these 
diagnostics are valid only within the range of abundances 
and line ratios probed by the analyzed data. 


5. RESULTS 


5.1. No evolution in temperature and density 

Figure [7] demonstrates that T e and n e do not evolve signif¬ 
icantly^t>etween z = 0 —> 0.8 at fixed emission line ra¬ 
tios. We quantify this with linear fits to the data and 
measure median offsets AlogT e = —0.013 ±0.014 and 
Alogn e = —0.05 ± 0.07 in the DEEP2 sample at z 0.8, 
relative to the local comparison sample at fixed [O ill]/H/?. 
Similarly there are no significant differences in density or 
temperature relative to any other available line ratio, nor 
to the derived metallicity. [O iii]/H/ 3 is used here in or¬ 
der to address the possibility that systematically higher n e 
may cause elevated ratios of [O iii]/H/ 3 and [N iij/Ha ob¬ 
served a t high redshift (e.g., Kewley et al.||2013 Shirazi et 
akf2014|). We find no evidence tor such an effect at 2 : = 0.8. 


with y = log [Ne m]/[0 11 ]. (Using y = log [O m]/[0 11 ] 
gives consistent results, but is subject to larger uncer¬ 
tainty arising from reddening corrections.) This provides 
improved fits since these strong emission line ratios are 
measured with much better precision than oxygen abun¬ 
dance. Combining these fits with the calibrations derived 
from Equation gives a relation in the form of Equation [3] 
Intrinsic scatter is derived from Equation [2] in the same way 
as the other diagnostics. 


Best-fit relations for each strong line abundance diagnostic 
are shown in Figure [9] Table [2] lists the corresponding fit 
coefficients and intrinsic scatter. We also include the re¬ 
lation between metallicity and [Ne m]/[0 ill] obtained by 
combining the [Ne III] / [O 11 ] and [O m]/[0 11 ] calibrations 
(rather than a di rect fit), as well as the 02Ne3 calibration 
introduced by Perez-Montero et al. (2007): 


02Ne3 = 


[O 11] + 17.06[Ne hi] 

S5 


R 23 


H /3 
RS’ 


which we obtain from multiplying the R 23 calibration by 
the Balmer line ratio (H<5/H/3 = 0.260). The value 17.06 


5.2. No evolution in Ne/O abundance 


The relative abundances of neon and oxygen are show n in 
Figure [ 8 ] and have been discussed briefly in Section |3.4| 
We now consider the positive correlation between Ne/O 
and O/H and its implications for high redshift galaxies. 


Large samples of local galaxies indicate a slope AtogCVH = 
0.097±0.015 (Equation 30 of|Izotov et al.[2006^, consistent 
with the data at 2 : ~ 0.8. We find no convincing evidence 
for evolution in either the slope or normalization of this 
relation: our T e sample is offset by a median A log Ne/O = 
0.011 ± 0.008 dex relative to the linear fit at z = 0. The 
cause of i ncreasing Ne/O wi t h O/ H is not entirely clear, 
however. Izotov et al.| (2006 2011) attribute this relation 
to depletion of oxygen onto dust grains, implying ^ 20 % 
of interstellar oxygen in the solid state for the most metal- 
rich galaxies in their sample. Alternatively, nucleosynthetic 
processes such as a-capture onto oxygen or conversion of 
14 N—l 22 Ne have been proposed to explain t he variation in 


Ne/ O (as discussed in, e.g., section 4.5 of Leisy & Dennefeld 
2006| . 
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Fig. 9.— Strong emission line ratios as a function of metallicity. Solid lines show best fit relations derived for the z ~ 0 sample, and dashed 
lines show the intrinsic scatter in each relation. Data at z ~ 0.8 are consistent with the z ~ 0 comparison sample to within the I —a measurement 
precision of AlogO/H < 0.03 dex. 


Theoretical supernova yields can provide some insight into 
the effects of nucleosynthesis and dust depletion. For this 


purpo se we integrated the yield tables of Nomoto et al. 
(2013;) for core-collapse supernovae with a Salpeter i | 19~)~>p 


initial mass functio n truncated at 50 M^ l (The IMF- 
integrated tables of |Kobayashi et aT7||2006| give similar re¬ 
sults). This gives an expected logNe/O = —0.71 for pro¬ 
genitor stars with l/5th solar metallicity, increasing to 
logNe/O = —0.67 for 2/5th solar metallicity due to roughly 
equal effects of higher neon and lower oxygen yields. We 
note that the 22 Ne production and hence 1? N—» 22 Ne reac¬ 
tion is negligible for these models. These predictions are 
in excellent agreement with the slope in Figure [8j indicat¬ 
ing that metallicity-dependent supernova yields can explain 
the trend. The small offset in predicted yields compared to 
the data at fixed metallicity is in the opposite direction 
expected from depletion of oxygen onto dust, hence the 
models suggest that depletion is not important. However 
we caution that this interpretation is subject to significant 
uncertainty in the theoretical yields. 

We can independently test whether the dust depletion 
hyp othes is is plausible by using nebular extinction (Sec¬ 


tion 


3.1) as a proxy for dust content. Despite low mea¬ 


surement uncertainty there is no significant correlation be¬ 
tween extinction and Ne/O in either the T e or local com¬ 
parison samples, in contrast to the >3u trend between 
O/H and Ne/O (Figure [8]). Therefore Ne/O appears to 


be more fundamentally related to O/H than to dust con¬ 
tent. This supports nucleosynthetic processes rather than 
dust depletion as the cause of variation in Ne/O, although 
we caution that knowledge of the dust-to-gas mass fractions 
would be needed for robust conclusions. The trend of in¬ 


creasing Ne/Ar with O/H (as pointed out by, e.g., Izotov 


et al. 2006) further suggests that nucleosynthetic yields of 
JNe and/or Ar vary with overall metallicity since neither of 
these noble gases is affected by dust depletion. 


Zeimann et al. (2015) have recently presented evidence for 
a ~0.2 dex increase in [Ne III] /[O ill] flux ratios at z ~ 2 
compared to local galaxies, although its origin is unclear. 
In contrast, we find that line ratios at £ ~ 0.8 are consis¬ 
tent with the locus of local star-forming galaxies (Figures[4j 
[9|. The T e and local comparison samples both have mean 
and median reddening-corrected log[Ne Hl]/[0 ill] = —1.11 
with 0.05 dex scatter (not corrected for measurement uncer¬ 
tainty). The d ifference in s ample mean is only 0.006±0.011 
dex. The Zeimann et al. (20151 results therefore may in¬ 
dicate very rapid evolution from z zz 1 —> 2, possibly from 
differences in nucleosynthetic yields, although this is diffi¬ 
cult to reconcile with the lack of evolution at z < 1. Further 
measurements at z > 2 are needed to confirm whether neon 
emission is indeed enhanced and to determine the physical 
cause. 
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5.3. No evolution in strong-line abundance diagnostics 

We now examine the quantitative relation between gas- 
phase metallicity and strong emission line ratios as a func¬ 
tion of redshift. [O m]/[0 il] and [Ne m]/[0 n] ratios 
are most easily compared since they vary monotonically 
with metallicity. We calculate expected metallicity for each 
galaxy in the T e sample based on measured line ratios and 
the best-fit relations derived from local galaxies (Table [2]), 
and compare the expected values with those derived in Sec- 
tion[3j We note that measurement uncertainty (particularly 
in [(J[il] A4363) results in a skewed metallicity distribution, 
such that the sample mean is biased toward higher val¬ 
ues while the weighted mean is biased toward lower val¬ 
ues. The median is robust, however, and we therefore 
use sample medians for comparison. The median offset at 
z ~ 0.8 is AlogO/H = 0.01 ± 0.03 for [Ne iii]/[ 0 n] and 
AlogO/H = —0.02±0.03 for [O m]/[0 II] . Other line ratios 
show similarly null evolution of typically —0.01 ± 0.03 dex. 
Averaging these measurements does not decrease the uncer¬ 
tainty, which is statistically limited by the sample size. We 
conclude that there is no evidence for evolution in metallic¬ 
ity at fixed line ratios from z = 0 —> 0.8. Local calibrations 
of the strong line abundance diagnostics considered here 
therefore appear to be valid to at least z ~ 1, within the 
measurement precision of 0.03 dex in logO/H. 

5.4. No evolution in a-element emission line ratios 

We show several line ratio diagnostic diagrams in Figure [TO} 
Qualitatively it is clear that galaxies in the T e sample at 
z — 0.8 follow the same locus as local star-forming galax¬ 
ies (as is also true for the parent DEEP2 sample, shown 
in Figure |4]). This holds for de-reddened flux ratios of 
emission lines which are widely separated in wavelength 
(e.g., R 2 3 and 0 32 = [O m]/[0 n]) as well as those for 
which reddening corrections are negligible ([O in]/H/3 and 
[Ne III]/ [O II]). Essentially, Figure [lO] demonstrates that 
star-forming galaxies form a tight locus in the parameter 
space defined by their intrinsic (de-reddened) [O n], [O ill], 
[Ne hi], and Balrner emission line fluxes, and that this locus 
does not evolve with redshift at z < 1. As a quantitative 
example, the [O iii]/H/3 versus [Ne iii]/[ 0 n] locus has a 
dispersion of only 0.04 dex in the T e sample (and 0.03 dex 
in the local comparison sample), with an offset of 0.01±0.01 
dex in [O hi]/H/ 3 at z ~ 0.08. 

The null evolution in line ratios is not surprising given 
the lack of any apparent evolution discussed previously in 
this section. However it is worth emphasizing given that 
very strong evolution is observed in the parameter space of 
[N n], [O i ll], and Balrner lines (with ^0.2-0.4 dex offsets 
at z 2.3; ISteidel et al.||2014| | Shapley et al.|[20l5] |Kewley| 
et al. |20 13 1 . These same galaxies show no evolution from 
the locus "defined by [O il], [O ill], and Balrner lines con¬ 
firming that the non-evolution seen in Figure [lO] holds to at 
least z > 2. Evolution in [N n]/Ha ratios at nigh redshift 
is therefore likel y due to systematically higher N/O ratio s 
at high redshift ( Shapley et al.|[2015[ Masters et al. 2014), 
arising from the complicated nucleosynthetic production of 
nitrogen. In contrast, a-capture elements such as oxygen 
and neon originate almost entirely from core-collapse super¬ 
novae and their production is thought to accurately trace 
the integrated star formation history with little scatter. 
Several authors have discussed how the average [O ill]/H/3 
ratio of star forming galaxies increases with redshift. Possi¬ 


ble explanations for this behavior include sample selection 
bias and evolution in various physical properties of H n 
regions (su ch as ionization parameter, density, and metal¬ 
licity; e.g., Juneau et al. 2014 Kewley et al. 2013). While 
all of these effects are likely present, our results suggest a 
simple scenario in which galaxies at all redshifts populate a 
constant locus of a-element and hydrogen line ratios, with 
evolution in the density of galaxies along the locus (i.e., 
toward higher [O III] /H/3 ratios at higher redshifts). The 
locus position is governed largely by metallicity as shown 
in Figure no] (Other properties such as ionization param¬ 
eter are of course correlated with metallicity.) Therefore, 
evolution of the galaxy population simply reflects the over¬ 
all increase in metallicity (O/H) over cosmic time, which 
has been quantified in numero us studies of evolution in the 
mass-meta llicity relation (e.g., Maiolino et al. 2008 |Zahid 


et al. 
evolu 


2013 and many others). These studies indicate an 
0.1-0.2 dex in metallicity at fixed stellar mass 


ion of' 

from z ~ 0 —> 0.8. This amounts to a change in [O ill]/H/3 
ratios by >0.2 dex for typical galaxies. We note that in con¬ 
trast to the diagrams in Figure 10 loci of [O ill]/H/3 as a 
function of [N ii]/ Hq o r stell ar mass (the Mass-Excitation 
diagram; Juneau et al. 2014) are not constant because of 
evolution m 1N/U abundances and stellar mass at fixed O/H. 

6. DISCUSSION 

6.1. Metallicity inferences at high redshift 

The primary purpose of this paper is to present a set of 
strong-line metallicity diagnostics that can be accurately 
used at high redshifts. Ultimately the diagnostics given in 
Table [2] and Figure [9] are calibrated from local data since 
we find that data at z ~ 0.8 give fully consistent results 
but with larger uncertainty. While this study is limited to 
z < 0.8, galaxies at z ~ 2.3 show no o ffset in the l ine ratios 
considered h ere (Shapley et al.||2015 but see also Zeimann 
et al. 2015 regarding [JNe III] ). We therefore consider it 
likely that these diagnostics are accurate up to z > 2 and 
perhaps even at all redshifts, although this has yet to be 
confirmed with direct T e -based measurements. We wish to 
emphasize that the diagnostics in Table [2] are only valid 
for the range of line ratios and metallicities probed by this 
work (12 + logO/H = 7.8-8.4). Figure [TO suggests that 
mild extrapolation may yield reasonable results but we do 
not endorse this. 

A broader consequence of the non-evolution in strongdine 
metallicity diagnostics (those in Table [2] and Figure [9] at 
least) is that any local calibration can be used to infer self- 
consistent metallicity evolution from high redshift data. A 
variety of such calibrations exist in the literature based 
on different methods including the “direct” T e method, 
photoionization models, and recombination lines. Differ¬ 
ent methods are well known to give discrepant results (e.g., 
Blanc et al. 20151 and hence the true absolute metallic¬ 
ity’scaleWeinams under debate. Even the calibrations pre¬ 
sented here are subject to systematic variation by up to 
0.05 dex in metallicity depending on the adopted atomic pa¬ 
rameters and T e ([0 Il])-T e ([0 ill]) relation. However, the 
T e -based analysis in this work demonstrates clearly that 
metallicity does not evolve at fixed strong line ratios, at 
least for the hydrogen and a-element lines considered here. 

We advocate using multiple diagnostics simultaneously to 
infer metallicity from strong-line methods. It is clear from 
Figures |9| and |10| that the usefulness of various diagnostics 
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Fig. 10.— Strong emission line ratios and best-fit metallicity relations (from Table |2|. All emission line ratios are corrected for nebular 
reddening although this correction is negligible for the left-most panel. Galaxies in the T e sample at z ~ 0.8 (blue points), the local comparison 
sample (red points), and the broader population of star forming galaxies in SDSS (grey shading) all populate the same locus with low scatter 
(~ 0.04 dex RMS). The T e sample is within 0.01 =t 0.01 dex of the locus defin ed by local galaxie s indicating no significant evolution at z < 1. The 
central panel is the “blue” diagnostic diagram with boundaries described by Lamareilleff20lb). We caution that the star forming locus extends 
into the Seyfert 2 region at low metallicities (12 + logO/H < 8.0). 


is a strong function of their position along the locus of line 
ratios (i.e., metallicity). For example, O 32 gives a more pre¬ 
cise constraint on metallicity than R 23 in the regime where 
O 32 > 1 (12 + logO/H < 8.4), while R 23 is the more precise 
indicator at higher metallicities. In any case the precision 
is clearly improved when using both in combination (right¬ 
most panel of Figure [To|. The best approach is of course to 
use all available diagn ostics to in fer metallicity, as was done 
for example in |Belli et al.| ([2013 ) and |Maiolino et al. (2008 
with an excellent illustrative example in Figure 6 of that 
paper). One may also use observed (reddened) flux ratios 
combined with priors such as H 7 /H/? = 0.47 to produce 
a 2 -dimensional prob ability distribution in both reddening 
and metallicity (e.g., Maiolino et al.| 2008 1 , which is more 
robust than adopting a fixed reddening estimate. We note 
that while [Ne m]/[0 in] is insensitive to metallicity, it pro¬ 
vides a valuable constraint and sanity check on the nebular 
reddening (Figure [6b. Knowledge of the intrinsic scatter 
is essential for comDining multiple diagnostics: the scatter 
allows for a straightforward calculation of the likelihood of 
any metallicity and nebular reddening (i.e., the posterior 
probability distribution) from a given set of emission line 
ratios, from which the most likely metallicity and confi¬ 
dence intervals can be computed. It is for this reason that 
we list the intrinsic scatter in line ratios at fixed metallicity 
for each of our calibrations in Table [2j 
It is worth considering whether the precision of strong-line 
metallicity methods can be improved by introducing a sec¬ 
ond parameter. For example, photoionization models sug¬ 
gest that the intrinsic scatter in metallicity calibrations can 
be reduced by fitting fo r the ionization parameter in addi 
tion to metallicity (e.g ., 


Kobulnicky & Kew- 
Ty the idea is to use 


. Pilyugin|200L; 
ley 2004 Blanc et al. 2015|). Typical! 
e.g. R 23 primarily as an indicator of metallicity, with a 
correction for ionization parameter derived from additional 
line ratios such as O 32 . We conducted an empirical test by 
measuring the correlation between metallicity and offsets in 
the direction orthogonal to the best-fit loci shown in Fig¬ 


ure |10| There is a significant trend in the local comparison 
sample, in the sense that 


A logO/H 
A log R 23 


= 1.89 ±0.14 


(5) 


at constant O 32 . The error represents formal uncertainty 
in the best-fit slope and does not reflect the much larger 
sample variance. While this indicates a statistically signifi¬ 
cant secondary parameter, its effect is negligible: applying 
Equation[5]reduces the intrinsic scatter in inferred metallic¬ 
ity by only 3% compared to using the O 32 calibration alone. 
Likewise the /(P,R 23 ) method of Pilyugin (2001) does not 
substantially reduce the scatter. We therefore find no com¬ 
pelling reason at this time to include secondary trends be- 
twe en m etallicity and offsets from the line ratio loci in Fig¬ 
ure [To] when calculating metallicity using the strong-line 
calibrations presented here. 

6.2. Prospects for T e -based metallicities at z > 2 

As the first such work at cosmological distances, our in¬ 
vestigation of strong-line metallicity diagnostics at z ~ 0.8 
provides helpful guidance for future efforts to undertake a 
similar study at higher redshifts. It is imperative to con¬ 
struct a sample which is complete in sensitivity, without re¬ 
gard to emission line detection significance as this induces 
a bias. We note that T e -based metallicities for 2 > 1.5 


galaxies reported in the literature (by ViUar-MartfnetjR 


2004; 

Yuan & Kewley 2009; Christensen et al. 2012; .James 

et al. 

2014 Steidei et al. 

20140 are on average lower than 


expected trorn their strong line ratios, although most in¬ 
dividual measurements are compatible with the z = 0 re¬ 
lations given the scatter. However the existing sample at 
these redshifts is too small for robust conclusions. For our 
sample, selecting only the galaxies with > 3 a detections 
of [O iii]A4363 in Figure [I] would lead us to conclude that 
metallicity calibrations evolve by A logO/H ~ —0.2 dex 
at fixed [O m]/[0 11 ] compared to z = 0. Even requir- 
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ing a 5 a detection would lead to an inferred evolution of 
>0.1 dex. A sensitivity such that uncertainty in the ra¬ 
tio of [O hi]A4363/[O iii]A5007 is a < 0.003 is sufficient 
for a sample with comparable (or lower) metallicity to that 
used here. Our sample of 32 galaxies yields a precision 
of 0.03 dex in the evolution of strong-line metallicity diag¬ 
nostics, and hence even a modest sample of 10-15 galaxies 
would provide a measurement of evolution at the ~0.05 dex 


level. Given the wealth of spectroscopic c 
from ground- and space-based surveys (e 

ata now in hand 

.g., Steidel et al. 

2014) Shapley et al. 2015 Maseda et al. 

2014), it may be 

practical to construct such a sample at z 
pre-selection and moderately deep follow 
requisite sensitivity to [O ill] A4363. 

6.3. Implications for evolution in the 

Our choice of [O iii]/H (3 as the abscissa ii 
tivated by the strong observed redshift ei 
[O Ill]/H /3 or/and [N n]/Ha (the classic 
We have found no evidence for changing 
tions (specifically n e , T e , O/H, and Ne/O 
fixed [O m]/H/3 flux ratios. Furthermore 
from the locus of line ratio diagrams invc 
and other a elements, with ~0.01 dex prec 
We therefore strongly suspect that evolutic 
agram is due to [N n]/Ha and not [O Ill]/E 
ward explanation is that N/O abundance r 
atically higher in high redshift galaxies al 
line ratios (i.e., fixed metallicity, as consic 

£ 2 with careful 
up to obtain the 

BPT diagram 

1 Figure [7] is rno- 
rolution m either 
BPT diagram), 
physical condi- 
in galaxies with 
we find no offset 
living [O iii]/H/3 
ision (Figure 10). 
m in the BPT' di- 
/?. A straightfor- 
atios are system- 
fixed a-element 

iered by Masters 

et al. 2014 Shapley et al. 2015 Steidel eta 


1. 2014|. This ap- 

pears to be a plausible explanation, since relatively nearby 
galaxies with properties similar to high redshift samples 
(such as the “green peas” and “Lyman break analogs”) 
have higher N/O than expected for their O/H metallic- 

ity (e.g., Izotov et al.|[2011 Amorfn et al. 

2010). This can 

result from various effects related to ing 
rates such as metal-poor gaseous inflows 
flows, a large population of Wolf-Rayet st 

n star formation 
metal-rich out¬ 
ers, and previous 

star formation history (e.g., Amorfn et al. 2010 Masters et 

al. 2014] Andrews & Martini 2013 Martin-Manjon et al. 


2008). 


It is not yet known whether the galaxies in our sam¬ 
ple at 2 ~ 0.8 are offset from local galaxies in terms 
of [N Ii]/Ha. Near-IR spectroscopic followup with Keck 
is ongoing to measure [N n], Ha, temperature-sensitive 
[O n]AA7320,7330, and other diagnostic features. Data 
gathered from the followup campaign will provide accurate 
T e -based measurements of N/O and determine whether ni¬ 
trogen abundance variations can plausibly explain system¬ 
atic offsets toward higher [N Ii]/Ha flux ratios as observed 
in high redshift galaxies. 

7. CONCLUSIONS 

We have examined the relation between various strong opti¬ 
cal emission line ratios and direct T e -based measurements 
of metallicity (gas-phase oxygen abundance) in a homo¬ 
geneous sample of star forming galaxies at z = 0.72-0.87 
drawn from the DEEP2 survey. While these strong-line 
diagnostics are the principal method used to infer galaxy 
metallicities beyond the local volume, this work represents 
the first direct calibration at cosmological distances. Our 
results confirm that at least some of the commonly used 
diagnostics are valid up to z < 1 and provide guidance for 
future studies at higher redshifts. 


Our main results are as follows: 


• The relation between metallicity and strong emission 
lines of oxygen, hydrogen, and neon does not evolve 
between z = 0 —> 0.8, to within our measurement pre¬ 
cision of 0.03 dex in O/H (la). The relation between 
Ne/O and O/H abundance is likewise constant: we 
measure AlogNe/O = 0.01 ± 0.01 at fixed O/H at 
z ~ 0.8 relative to z = 0. Therefore, these locally cal¬ 
ibrated strong-line metallicity diagnostics are valid to 
at least z — 0.8, and they may remain accurate even 
at the highest redshifts. 

• We present a set of calibrations between metallicity 
and strong line ratios with the intent of using multi¬ 
ple such diagnostics simultaneously. Our calibrations 
define a locus in the parameter space of [O il], [O ill], 
[Ne in], and Balmer emission lines which is designed 
to closely follow the observed locus of star forming 
galaxies. Via these diagnostics, metallicity can be ac¬ 
curately inferred from the position of a galaxy on var¬ 
ious two-dimensional line ratio diagrams (e.g., O 32 vs. 
R23) with improved precision compared to using a sin¬ 
gle diagnostic such as R23. We include measurements 
of the intrinsic scatter so that multiple diagnostics 
can be combined to calculate a posterior probability 
distribution in metallicity (and nebular reddening), 
given a set of emission line ratios. 

We find evidence for a secondary parameter in the 
sense that metallicity is correlated with a galaxy’s 
offset from the locus. However, including this offset 
as an additional parameter does not substantially re¬ 
duce the scatter in our strong-line calibrations. 


• Star forming galaxies at 2 ~ 0.8 follow the same locus 
of oxygen, hydrogen, and neon emission line ratios as 
at z = 0, with offsets of ~ 0.01 ± 0.01 dex and low 
scatter (<0.04 dex). This is in contrast to the star for¬ 
mation locus of [N n]/Ha versus [O iii]/H/3, which has 
larger scatter and evolves with redshift (by >0.2 dex 
at z = 2). Our results as well as earlier work (jShapley 
et al.|2015 Masters et al.|2014 1 suggest that this evo- 
lution is exclusively in IN III /Ha, possi 


exclusively in [IN II] /Ha, possibly caused by 
N/O abundance variations. Ongoing followup spec¬ 
troscopy is needed to confirm or refute this hypothesis 
within our sample. 


While this work represents an important first step toward 
confirming the validity of strong-line methods for inferring 
metallicity in high redshift galaxies, we are acutely aware of 
the limited range in redshift and metallicity probed by our 
T e sample. The narrow z = 0.72-0.87 is restricted solely by 
the availability of spectra with sufficient sensitivity, spec¬ 
tral resolution, and wavelength coverage. Extending this 
work to higher redshifts will require suitable spectra at 
near-infrared wavelengths. Encouragingly, near-IR spectra 
now exist for large samples of galaxies at z = 1-3.5 thanks 
to dedicated surveys with multiplexed instruments such 
as Keck/MOSFIRE, VLT/KMOS, and HST/WFC3 (e.g., 
Steidel et aH2014]|Kriek et al.|2014| [Wisnioski et al.||2015| 

Hrammer et al.|2012a|). These surveys are not deep enough 

to detect T e -sensitive emission lines, however, and so ad¬ 
ditional followup will be necessary for T e -based studies. It 
should already be practical to select modest sub-samples for 
efficient followup using a selection method similar to ours 
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(see Figure [TJ. requiring strong [O ill] AA4959,5007 lines and 
fa vorable reash ifts for observing [O III] A4363. The work 
of Maseda et al.| (20141 illustrates the potential efficacy of 
this approach. Pre-selection of [O ill] emitters from HST 
grism spectra offers an additional advantage over purely 
ground-based spectroscopy by including redshifts at which 
the strong lines are affected by telluric features (for example 
z ~ 1.65-2.0, where [O iii]A4363 is in the favorable J-band 
window while [O ill] A5007 and H/3 are obscured by telluric 
absorption). 

It is somewhat fortuitous that our calibrations cover only 
12 + logO/H < 8.4, since T e -based results and other meth¬ 
ods become increasingly discrepant at higher metallicity. 
There are practical concerns as well: [O ill] A4363 emission 
is progressively weaker at higher metallicity due to a com¬ 
bination of decreasing H n region temperature and a lower 
0 ++ fraction. At higher metallicities, oxygen abundance 
is dominated by the 0 + ion and the temperature-sensitive 
[O n]AA7320,7330 lines become stronger than ]Q m]A4363 
(at 12+logO/H, > 8.2-8.4; e.g., Izotov et al.|2006j [Andrews' 
& Martini||2013|). Therefore, selecting galaxies with strong 
[U nj emission and precise measurements of T e ([0 Ii]) may 
provide a practical way to extend our study to higher 
metallicities. Additionally the T e -sensitive sulfur li nes 


[S ii]AA4069,4076 and ]S ill] A6312 may be useful (e .g., Peim 
bert & Costero 1969 Perez-Montero fe Diaz] 20031. StackecT 


spectra are an other possible way forward (e.g., Andrews fe 
Martini||2013 ). 


Throughout this work we have been careful to compare data 
at z ~ 0 and z — 0.8 using consistent methods. Our results 
regarding redshift evolution are therefore robust. However, 
we caution that our T e -based methods do not necessar¬ 
ily reflect the absolute metallicity scale. Different assump¬ 


tions for ionization corrections, the T e ([0 Il])-T e ([0 III]) 
relation, and atomic properties can plausibly change the 
derived metallicities by ^0.05 dex. Different methods (e.g., 
based on photoionization modeling or recombination lines) 
suggest even larger differences of ~0.2 dex. Discrepancies 
between these methods can plausibly be reconciled by spa¬ 
tial variations in temperature or by a non-Maxwell ian elec¬ 
tron energy distribution (e.g., Dopita et al. 2013). If the 
cause can be identified and characterized, then optimisti¬ 
cally the T e method and associated strong-line diagnostic 
calibrations can be updated to produce truly reliable metal¬ 
licity measurements. For now, we can at least rule out any 
strong systematic evolution with redshift for the strong-line 
diagnostics considered here. 
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